CLAIMS 

What is claimed is: 



1 1 . A method for managing caches in a system with multiple caches that may contain 

2 different copies of a data item, comprising the steps of: 

3 modifying the data item in a first node of said multiple caches to create a modified 

4 data item; 

5 sending the modified data item from said first node to a second node of said multiple 

6 caches without durably storing the modified data item from said first node to 
O 7 persistent storage; 

yp 8 after said modified data item has been sent from said first node to said second node, 
ffl 9 said first node sending a request to a master of said data item for writing said 

|p 10 data item to persistent storage; and 

f% 11 in response to said request, said master coordinating with said multiple caches to 

h s 

P| 12 cause said data item to be written to persistent storage. 



fy 1 2. The method of Claim 1 wherein: 

2 the method includes the step of maintaining, within an ordered series of bins, entries 

3 for past-image versions of data items; 

4 each bin in said ordered series corresponds to a time range; 

5 a particular bin corresponds to the time range that covers the time at which the data 

6 item is modified in said first node; and 

7 the step of sending a request is performed by sending a request for writing a particular 

8 bin of said ordered series of bins to persistent storage. 

1 3. The method of Claim 2 wherein the step of said master coordinating with said 

2 multiple caches to cause said data item to be written to persistent storage includes 

3 said master causing said multiple caches to write data items to persistent storage to 
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cover all past image versions of data items that were modified during the time range 
of said particular bin. 

4. The method of Claim 3 further comprising the step of emptying said particular bin 
after said multiple caches write data items to persistent storage to cover all past image 
versions of data items that were modified during the time range of said particular bin. 

5. The method of Claim 4 wherein the step of emptying said particular bin includes the 
steps of: 

discarding entries within said particular bin that are associated with past images that 
have last-dirtied times within the time range of said particular bin; and 

moving to one or more other bins the entries within said particular bin that are 

associated with past images that have last-dirtied times later than the time 
range of said particular bin. 

6. The method of Claim 1 wherein the step of sending a request to a master is performed 
by sending the request to a global lock manager. 

7. The method of Claim 1 wherein the step of sending a request to a master is performed 
by sending the request to a lock manager that is one of a plurality of lock managers 
within a distributed lock management system. 

8. The method of Claim 1 further comprising the step of sending from the master, to 
interested nodes, write-notification messages indicating that said data item has been 
written to persistent storage, in response to said data item being written to persistent 
storage. 
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9. The method of Claim 8 wherein the step of sending write-notification messages 
includes the master sending to at least one interested node a single message that 
notifies said at least one interested node that a plurality of data items have been 
written to persistent storage. 

10. The method of Claim 1 wherein the step of said first node sending a request to a 
master of said data item for writing said data item to persistent storage includes the 
first node sending to said master a single message that requests writing a plurality of 
data items to persistent storage, wherein said plurality of data items includes said data 
item. 

1 1 . The method of Claim 10 wherein the step of sending a single message includes 
sending a message that identifies a bin to request that all data items that belong to the 
bin be written to persistent storage. 

12. The method of Claim 1 1 wherein the bin is associated with a range of time and 
includes data items that were first dirtied by the first node during said range of time 
and that were subsequently transferred to other nodes without first being written to 
persistent storage. 

1 3 . The method of Claim 8 wherein the step of sending from the master to interested 
nodes write-notification messages includes the steps of: 

immediately sending write-notification messages to a first set of interested nodes, 

where said first set of interested nodes includes the interested nodes that have 
requested said data item to be written to persistent storage; and 
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delaying the sending of write-notification messages to a second set of nodes, where 
said second set of nodes includes interested nodes that do not belong to said 
first set of interested nodes. 

14. The method of Claim 8 wherein the step of sending from the master to interested 
nodes includes delaying the sending of write-notification messages to at least one 
interested node. 

15. The method of Claim 14 wherein a write-notification message is sent to the at least 
one interested node in response to a lock request made by said at least one interested 
node. 

1 6. The method of Claim 1 4 wherein a write-notification message is sent to the at least 
one interested node in response to the at least one interested node requesting that said 
data item be written to persistent storage. 

17. The method of Claim 14 wherein a write-notification message is sent to the at least 
one interested node within a ping request that the master sends to the at least one 
interested node for the at least one interested node to transfer another data item to 
another node. 

18. The method of Claim 1 wherein the step of coordinating includes the steps of: 
determining whether a version of said data item, that is at least as recent as said 

modified version, has already been written to persistent storage; and 
if a version of said data item that is at least as recent as said modified version has 
already been written to persistent storage, then sending a write-notification 
message from said master to notify said first node that a version of said data 
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7 item that is at least as recent as said modified version has already been written 

8 to persistent storage. 

1 19. The method of Claim 1 8 wherein the step of coordinating includes, if a version of 

2 said data item that is at least as recent as said modified version has not already been 

3 written to persistent storage, then sending a write-perform message from said master 

4 to grant permission for said modified version to be written to persistent storage. 

1 20. The method of Claim 1 wherein the step of coordinating includes the 
O 2 steps of: 

# 3 selecting a particular node of said multiple caches that has a particular version of said 

Hi 4 data item, wherein said particular version is at least as recent as the modified 

m 5 data item in said first node; and 

p 6 causing said particular version of said data item to be written from said particular 

23; 7 node to persistent storage. 

fij 1 21. The method of Claim 20 wherein the step of selecting a particular node includes 

2 selecting the node, of said multiple caches, that has a most recently modified version 

3 of said data item. 

1 22. The method of Claim 20 further comprising the step of the master informing the first 

2 node that said data item has been written to persistent storage in response to the 

3 master receiving confirmation that said particular version of said data item has been 

4 written to persistent storage. 

1 23 . The method of Claim 20 further comprising the step of the master informing a set of 

2 caches that said data item has been written to persistent storage in response to the 

3 master receiving confirmation that said particular version of said data item has been 
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4 written to persistent storage, wherein said set of caches includes caches, other than 

5 said particular node, that contain modified versions of said data item that are not more 

6 recent than said particular version. 

1 24. A method for managing caches in a system with multiple caches that may contain 

2 different copies of a data item, comprising the steps of: 

3 modifying the data item in a first cache to create a modified data item; 

4 in response to writing the modified data item to persistent storage, performing the 

5 steps of: 

6 a node associated with the first cache determining whether any other cache in 

7 said multiple caches had created a dirty version of said data item; and 

8 if any other cache in said multiple caches had created a dirty version of said 

9 data item, then the node associated with the first cache informing a 

1 0 master of said data item that said modified data item has been written 

11 to persistent storage; and 

12 if no other cache in said multiple caches had created a dirty version of said 

1 3 data item, then the step of writing the modified data item is performed 

14 without informing said master that said modified data item has been 

1 5 written to persistent storage. 

1 25 . The method of Claim 24 wherein the step of determining whether any other cache in 

2 said multiple caches had created a dirty version of said data item includes inspecting a 

3 global dirty flag associated with said data item. 

1 26. The method of Claim 25 wherein: 

2 prior to modifying the data item in said first cache, the data item had been modified in 

3 a second cache; 
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4 the data item is not persistently stored between being modified in said second cache 

5 and being modified in said first cache; and 

6 a node associated with the second cache causes said global dirty flag to be set to 

7 indicate that said data item is globally dirty. 

1 27. The method of Claim 26 wherein the node associated with the second cache causes 

2 said global dirty flag to be set in response to transferring said data item from said 

3 second cache to another cache of said multiple caches. 

1 28. A method for managing caches in a system with multiple caches that may contain 

2 different copies of a data item, comprising the steps of: 

3 modifying the data item in a first cache to create a modified data item; 

4 when a node associated with the first node desires to write said modified data item to 

5 persistent storage, performing the steps of: 

6 if then node associated with the first cache does not currently have ownership 

7 rights to said data item, then the node associated with the first cache 

8 sending a request to a master of said data item for said data item to be 

9 written to persistent storage; and 

1 0 if said node associated with said first cache currently has ownership rights to 

1 1 said data item, then the first node writing said data item to persistent 

12 storage without sending a request to said master for said data item to 

13 be written to persistent storage. 

1 28. The method of Claim 27 further comprising the step of designating holders of 

2 exclusive locks in data items to be owners of said data items. 

1 29. A method for managing a data item, the method comprising the steps of: 
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2 when a node that has an exclusive lock on a data item desires to write the data item to 

3 persistent storage, performing the steps of 

4 determining whether a mode associated with the data item is local or global; 

5 if the mode associated with the data item is local, then the node writing the 

6 data item to persistent storage without communicating with a master of 

7 said data item; and 

8 if the mode associated with the data item is global, then the node sending a 

9 message to the master of the data item to request writing of said data 
10 item to persistent storage. 



The method of Claim 29 wherein: 

the mode associated with the data item is global; and 

the method further includes: 

the node receiving permission from the master to write the data item to 

persistent storage; and 
after writing the data item to persistent storage, changing the mode from 
global to local. 



1 31. The method of Claim 29 wherein: 

2 the mode associated with the data item is local; and 

3 before the node has completed writing of the data item to persistent storage, the node 

4 transfers the exclusive lock on the data item to another node. 

1 32. The method of Claim 3 1 wherein the node changes the mode from local to global 

2 prior to transferring the exclusive lock on the data item to another node. 

1 33. The method of Claim 3 1 wherein the node informs the master when the node has 

2 completed writing the data item to persistent storage. 
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p 1 30. 

jll 2 

m 3 

p 4 

S 5 



1 


34. 


The method of Claim 3 1 wherein the master informs the other node that the node has 


2 




completed writing the data item to persistent storage in response to the node 


3 




informing the master that the node has completed writing the data item to persistent 


4 




storage. 


1 


35. 


The method of Claim 32 wherein the other node changes the mode from global to 


2 




local in response to a message from the master after the node has informed the master 


3 




that the node has completed writing the data item to persistent storage. 


lis? ^ 


36. 


A method for managing a data item, the method comprising the steps of: 


01 2 




when a data item is transferred from one node to another node, performing the steps 


S 3 




of 


in 4 




if the data item has been dirtied by the node and a mode associated with the 


|y 5 

n 




data item is local, then changing the mode from local to global prior to 


# s 6 




sending the data item to another node; 


ft! 7 




if the data item has not been dirtied by the node and the mode associated with 


8 




the data item is local, then sending the data item to the other node 


9 




without changing the mode; 


10 




allowing the other node to write the data item to persistent storage without 


11 




requesting permission if the mode is local; and 


12 




requiring the other node to obtain permission to write the data item to 


13 




persistent storage if the mode is global. 


1 


37. 


The method of Claim 36 wherein the node transfers the data item to the other node 


2 




prior to completion of the node writing the data item to persistent storage. 
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38. The method of Claim 37 wherein, after completion of the node writing the data item 
to persistent storage, the node sends a message to a master of the data item to indicate 
that the data item has been written to persistent storage. 

39. The method of Claim 38 wherein: 

the other node receives the data item in global mode; and 

the other node sends a request to the master of the node for permission to write the 
data item; and 

the master responds to said request by informing said other node to change said mode 
from global to local. 

40. A method for managing versions of a data item, the method comprising the steps of: 
when a dirty version of a data item is transferred from a first node to a second node 

while a being-written version of the data item is being written to persistent 
storage, performing the steps of: 

communicating version information about the being- written version to the 
second node; and 

based on the version information, the second node preventing any version of 
the data item that belongs to a first set of versions from being merged 
with any version of the data item that belongs to a second set of 
versions; 

wherein the first set of versions includes all versions of the data item within 
the second node that are at least as old as the being-written version; 
and 

wherein the second set of versions includes versions of the data item within 
the second node that are newer than the being- written version. 



50277-1725 



-56- 



1 41. The method of Claim 40 wherein the step of communicating is performed by a master 

2 assigned to said data item. 

1 42. The method of Claim 40 wherein: 

2 the second node includes a plurality of versions in said first set; and 

3 the second node merges said plurality of versions. 

1 43 . The method of Claim 40 further comprising the steps of: 
1^ 2 informing the second node when the being-written version has been successfully 



0 3 written to persistent storage; and 

fj 4 after the second node has been informed that the being-written version has been 
f* 5 successfully written to persistent storage, allowing said second node to discard 

6 all versions in said first set of versions. 



% 1 44. The method of Claim 42 further comprising the steps of 

% 2 informing the second node when the being-written version has been successfully 

m 3 written to persistent storage; and 

4 after the second node has been informed that the being- written version has been 

5 successfully written to persistent storage, allowing said second node to discard 

6 a merged version created by merging said plurality of versions. 

1 45. A method for managing past images of a data item, the method comprising the steps 

2 of: 

3 estimating a likelihood that a first past version of a data item will soon be written to 

4 persistent storage or covered by a write to persistent storage; 
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5 if the estimated likelihood is exceeds a particular threshold, then storing a second past 

6 version of the data item separate from the first past version of the data item; 

7 and 

8 if the estimated likelihood falls below a particular threshold, then merging the second 

9 past version of the data item with the first past version of the data item. 

1 46. The method of Claim 45 wherein the step of estimating is based on a comparison 

2 between a time associated with the first past version of the data item and a time 

3 associated with a recent entry in a redo log file. 

1 47. The method of Claim 45 wherein the step of estimating is based on a comparison 

2 between a time associated with the first past version of the data item and a time 

3 associated with an entry at the head of a checkpoint queue. 

1 48. A computer-readable medium carrying instructions for managing caches in a system 

2 with multiple caches that may contain different copies of a data item, the instructions 

3 comprising instructions for performing the steps of: 

4 modifying the data item in a first node of said multiple caches to create a modified 

5 data item; 

6 sending the modified data item from said first node to a second node of said multiple 

7 caches without durably storing the modified data item from said first node to 

8 persistent storage; 

9 after said modified data item has been sent from said first node to said second node, 

10 said first node sending a request to a master of said data item for writing said 

1 1 data item to persistent storage; and 

12 in response to said request, said master coordinating with said multiple caches to 

1 3 cause said data item to be written to persistent storage. 
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1 49. The computer-readable medium of Claim 48 wherein: 

2 the computer-readable medium includes instructions for performing the step of 

3 maintaining, within an ordered series of bins, entries for past-image versions 

4 of data items; 

5 each bin in said ordered series corresponds to a time range; 

6 a particular bin corresponds to the time range that covers the time at which the data 

7 item is modified in said first node; and 

8 the step of sending a request is performed by sending a request for writing a particular 

9 bin of said ordered series of bins to persistent storage. 

1 50. The computer-readable medium of Claim 49 wherein the step of said master 

2 coordinating with said multiple caches to cause said data item to be written to 

3 persistent storage includes said master causing said multiple caches to write data 

4 items to persistent storage to cover all past image versions of data items that were 

5 modified during the time range of said particular bin. 

1 51. The computer-readable medium of Claim 50 further comprising instructions for 

2 performing the step of emptying said particular bin after said multiple caches write 

3 data items to persistent storage to cover all past image versions of data items that 

4 were modified during the time range of said particular bin. 

1 52. The computer-readable medium of Claim 5 1 wherein the step of emptying said 

2 particular bin includes the steps of: 

3 discarding entries within said particular bin that are associated with past images that 

4 have last-dirtied times within the time range of said particular bin; and 
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moving to one or more other bins the entries within said particular bin that are 

associated with past images that have last-dirtied times later than the time 
range of said particular bin. 

53. The computer-readable medium of Claim 48 wherein the step of sending a request to 
a master is performed by sending the request to a global lock manager. 

54. The computer-readable medium of Claim 48 wherein the step of sending a request to 
a master is performed by sending the request to a lock manager that is one of a 
plurality of lock managers within a distributed lock management system. 

55. The computer-readable medium of Claim 48 further comprising instructions for 
performing the step of sending from the master, to interested nodes, write-notification 
messages indicating that said data item has been written to persistent storage, in 
response to said data item being written to persistent storage. 

56. The computer-readable medium of Claim 55 wherein the step of sending write- 
notification messages includes the master sending to at least one interested node a 
single message that notifies said at least one interested node that a plurality of data 
items have been written to persistent storage. 

57. The computer-readable medium of Claim 48 wherein the step of said first node 
sending a request to a master of said data item for writing said data item to persistent 
storage includes the first node sending to said master a single message that requests 
writing a plurality of data items to persistent storage, wherein said plurality of data 
items includes said data item. 
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1 58. The computer-readable medium of Claim 57 wherein the step of sending a single 

2 message includes sending a message that identifies a bin to request that all data items 

3 that belong to the bin be written to persistent storage. 

1 59. The computer-readable medium of Claim 58 wherein the bin is associated with a 

2 range of time and includes data items that were first dirtied by the first node during 

3 said range of time and that were subsequently transferred to other nodes without first 

4 being written to persistent storage. 

^ 1 60. The computer-readable medium of Claim 55 wherein the step of sending from 

J 1 ; 2 master to interested nodes write-notification messages includes the steps of: 

f % - 3 immediately sending write-notification messages to a first set of interested nodes, 

P 4 where said first set of interested nodes includes the interested nodes that have 

■? 

0 5 requested said data item to be written to persistent storage; and 

$y 

P 6 delaying the sending of write-notification messages to a second set of nodes, where 

|3 7 said second set of nodes includes interested nodes that do not belong to said 

8 first set of interested nodes. 

1 61 . The computer-readable medium of Claim 55 wherein the step of sending from the 

2 master to interested nodes includes delaying the sending of write-notification 

3 messages to at least one interested node. 

1 62. The computer-readable medium of Claim 61 wherein a write-notification message is 

2 sent to the at least one interested node in response to a lock request made by said at 

3 least one interested node. 
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1 63 . The computer-readable medium of Claim 61 wherein a write-notification message is 

2 sent to the at least one interested node in response to the at least one interested node 

3 requesting that said data item be written to persistent storage. 

1 64. The computer-readable medium of Claim 61 wherein a write-notification message is 

2 sent to the at least one interested node within a ping request that the master sends to 

3 the at least one interested node for the at least one interested node to transfer another 

4 data item to another node. 

pi 65 . The computer-readable medium of Claim 48 wherein the step of coordinating 

\& 2 includes the steps of: 

ill 

|V 3 determining whether a version of said data item, that is at least as recent as said 

w\ 

J** 4 modified version, has already been written to persistent storage; and 

H 5 if a version of said data item that is at least as recent as said modified version has 

% 6 already been written to persistent storage, then sending a write-notification 

W 7 message from said master to notify said first node that a version of said data 

Hi 

8 item that is at least as recent as said modified version has already been written 

9 to persistent storage. 

1 66. The computer-readable medium of Claim 65 wherein the step of coordinating 

2 includes, if a version of said data item that is at least as recent as said modified 

3 version has not already been written to persistent storage, then sending a write- 

4 perform message from said master to grant permission for said modified version to be 

5 written to persistent storage. 

1 67. The computer-readable medium of Claim 48 wherein the step of 

2 coordinating includes the steps of: 
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3 selecting a particular node of said multiple caches that has a particular version of said 

4 data item, wherein said particular version is at least as recent as the modified 

5 data item in said first node; and 

6 causing said particular version of said data item to be written from said particular 

7 node to persistent storage. 

1 68. The computer-readable medium of Claim 67 wherein the step of selecting a particular 

2 node includes selecting the node, of said multiple caches, that has a most recently 

3 modified version of said data item. 

1 69. The computer-readable medium of Claim 67 further comprising instructions for 

2 performing the step of the master informing the first node that said data item has been 

3 written to persistent storage in response to the master receiving confirmation that said 

4 particular version of said data item has been written to persistent storage. 

1 70. The computer-readable medium of Claim 67 further comprising instructions for 

2 performing the step of the master informing a set of caches that said data item has 

3 been written to persistent storage in response to the master receiving confirmation 

4 that said particular version of said data item has been written to persistent storage, 

5 wherein said set of caches includes caches, other than said particular node, that 

6 contain modified versions of said data item that are not more recent than said 

7 particular version. 

1 71 . A computer-readable medium carrying instructions for managing caches in a system 

2 with multiple caches that may contain different copies of a data item, the instructions 

3 comprising instructions for performing the steps of: 

4 modifying the data item in a first cache to create a modified data item; 
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in response to writing the modified data item to persistent storage, performing the 
steps of: 

a node associated with the first cache determining whether any other cache in 
said multiple caches had created a dirty version of said data item; and 

if any other cache in said multiple caches had created a dirty version of said 
data item, then the node associated with the first cache informing a 
master of said data item that said modified data item has been written 
to persistent storage; and 

if no other cache in said multiple caches had created a dirty version of said 

data item, then the step of writing the modified data item is performed 
without informing said master that said modified data item has been 
written to persistent storage. 

72. The computer-readable medium of Claim 71 wherein the step of determining whether 
any other cache in said multiple caches had created a dirty version of said data item 
includes inspecting a global dirty flag associated with said data item. 

73. The computer-readable medium of Claim 72 wherein: 

prior to modifying the data item in said first cache, the data item had been modified in 
a second cache; 

the data item is not persistently stored between being modified in said second cache 

and being modified in said first cache; and 
a node associated with the second cache causes said global dirty flag to be set to 

indicate that said data item is globally dirty. 

74. The computer-readable medium of Claim 73 wherein the node associated with the 
second cache causes said global dirty flag to be set in response to transferring said 
data item from said second cache to another cache of said multiple caches. 
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1 75. A computer-readable medium carrying instructions for managing caches in a system 

2 with multiple caches that may contain different copies of a data item, the instructions 

3 comprising instructions for performing the steps of: 

4 modifying the data item in a first cache to create a modified data item; 

5 when a node associated with the first node desires to write said modified data item to 

6 persistent storage, performing the steps of: 

7 if then node associated with the first cache does not currently have ownership 

8 rights to said data item, then the node associated with the first cache 

9 sending a request to a master of said data item for said data item to be 

1 0 written to persistent storage; and 

1 1 if said node associated with said first cache currently has ownership rights to 
p 12 said data item, then the first node writing said data item to persistent 
\_ 13 storage without sending a request to said master for said data item to 
W 14 be written to persistent storage. 



n : 1 75. The computer-readable medium of Claim 74 further comprising instructions for 

2 performing the step of designating holders of exclusive locks in data items to be 

3 owners of said data items. 



1 76. A computer-readable medium carrying instructions for managing a data item, the 

2 instructions comprising instructions for performing the steps of: 

3 when a node that has an exclusive lock on a data item desires to write the data item to 

4 persistent storage, performing the steps of 

5 determining whether a mode associated with the data item is local or global; 

6 if the mode associated with the data item is local, then the node writing the 

7 data item to persistent storage without communicating with a master of 

8 said data item; and 
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if the mode associated with the data item is global, then the node sending a 
message to the master of the data item to request writing of said data 
item to persistent storage. 

77. The computer-readable medium of Claim 76 wherein: 
the mode associated with the data item is global; and 

the computer-readable medium further includes instructions for: 

the node receiving permission from the master to write the data item to 

persistent storage; and 
after writing the data item to persistent storage, changing the mode from 
global to local. 

78. The computer-readable medium of Claim 76 wherein: 
the mode associated with the data item is local; and 

before the node has completed writing of the data item to persistent storage, the node 
transfers the exclusive lock on the data item to another node. 

79. The computer-readable medium of Claim 78 wherein the node changes the mode 
from local to global prior to transferring the exclusive lock on the data item to another 
node. 

80. The computer-readable medium of Claim 78 wherein the node informs the master 
when the node has completed writing the data item to persistent storage. 

8 1 . The computer-readable medium of Claim 78 wherein the master informs the other 
node that the node has completed writing the data item to persistent storage in 
response to the node informing the master that the node has completed writing the 
data item to persistent storage. 
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1 82. The computer-readable medium of Claim 79 wherein the other node changes the 

2 mode from global to local in response to a message from the master after the node has 

3 informed the master that the node has completed writing the data item to persistent 

4 storage. 

1 83 . A computer-readable medium carrying instructions for managing a data item, the 

2 instructions comprising instructions for performing the steps of: 

3 when a data item is transferred from one node to another node, performing the steps 

Jf 4 of 

0 5 if the data item has been dirtied by the node and a mode associated with the 

lift 

M 6 data item is local, then changing the mode from local to global prior to 

jM 7 sending the data item to another node; 

t 8 if the data item has not been dirtied by the node and the mode associated with 

y 9 the data item is local, then sending the data item to the other node 

'% 10 without changing the mode; 

% 1 1 allowing the other node to write the data item to persistent storage without 

12 requesting permission if the mode is local; and 

1 3 requiring the other node to obtain permission to write the data item to 

14 persistent storage if the mode is global. 

1 84. The computer-readable medium of Claim 83 wherein the node transfers the data item 

2 to the other node prior to completion of the node writing the data item to persistent 

3 storage. 

1 85. The computer-readable medium of Claim 84 wherein, after completion of the node 

2 writing the data item to persistent storage, the node sends a message to a master of the 

3 data item to indicate that the data item has been written to persistent storage. 
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86. The computer-readable medium of Claim 85 wherein: 
the other node receives the data item in global mode; and 

the other node sends a request to the master of the node for permission to write the 
data item; and 

the master responds to said request by informing said other node to change said mode 
from global to local. 

87. A computer-readable medium carrying instructions for managing versions of a data 
item, the instructions comprising instructions for performing the steps of: 

when a dirty version of a data item is transferred from a first node to a second node 
while a being-written version of the data item is being written to persistent 
storage, performing the steps of: 

communicating version information about the being-written version to the 
second node; and 

based on the version information, the second node preventing any version of 
the data item that belongs to a first set of versions from being merged 
with any version of the data item that belongs to a second set of 
versions; 

wherein the first set of versions includes all versions of the data item within 
the second node that are at least as old as the being-written version; 
and 

wherein the second set of versions includes versions of the data item within 
the second node that are newer than the being- written version. 

88. The computer-readable medium of Claim 87 wherein the step of communicating is 
performed by a master assigned to said data item. 
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1 89. The computer-readable medium of Claim 87 wherein: 

2 the second node includes a plurality of versions in said first set; and 

3 the second node merges said plurality of versions. 

1 90. The computer-readable medium of Claim 87 further comprising instructions for 

2 performing the steps of: 

3 informing the second node when the being- written version has been successfully 

4 written to persistent storage; and 

U 5 after the second node has been informed that the being-written version has been 

q 6 successfully written to persistent storage, allowing said second node to discard 

|J 7 all versions in said first set of versions. 

13: 1 91 . The computer-readable medium of Claim 89 further comprising instructions for 

O 2 performing the steps of: 

Q 3 informing the second node when the being-written version has been successfully 

h 4 written to persistent storage; and 

5 after the second node has been informed that the being- written version has been 

6 successfully written to persistent storage, allowing said second node to discard 

7 a merged version created by merging said plurality of versions. 

1 92. A computer-readable medium carrying instructions for managing past images of a 

2 data item, the instructions comprising instructions for performing the steps of: 

3 estimating a likelihood that a first past version of a data item will soon be written to 

4 persistent storage or covered by a write to persistent storage; 

5 if the estimated likelihood is exceeds a particular threshold, then storing a second past 

6 version of the data item separate from the first past version of the data item; 

7 and 

-69- 



50277-1725 



if the estimated likelihood falls below a particular threshold, then merging the second 
past version of the data item with the first past version of the data item. 



The computer-readable medium of Claim 92 wherein the step of estimating is based 
on a comparison between a time associated with the first past version of the data item 
and a time associated with a recent entry in a redo log file. 

The computer-readable medium of Claim 92 wherein the step of estimating is based 
on a comparison between a time associated with the first past version of the data item 
and a time associated with an entry at the head of a checkpoint queue. 
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